WAVE IMAGE COMPRESSION 



Cross Reference to Related Applications 

This application claims the benefit of U.S. Provisional Application Serial No. 60/171,509, 
filed on December 22, 1999. 
Federally Sponsored Research and Development 

This invention was made in part with government support under contract no. DAAB07-97-D- 
H754 awarded by the U.S. Army. The U.S. Government has certain rights in this invention.. 
Technical Field of the Invention 

The present invention relates to the field of digital data compression and decompression in 
particular to a compression and decompression method that implements combined discrete wavelet 
transformation, and a method of data packing. 
Background of the Invention 

The use of digitized multimedia, such as motion video and still images, has increased the 
demand on microprocessors and available bandwidth. The use of World Wide Web browsers as 
graphical user interfaces and electronic commerce on the Internet has increased the need for 
graphical images that are visually appealing and are of high resolution. Unfortunately, high image 
quality creates a demand for increased storage space for digital data. Industry has recognized the 
need for compression of the digital data to help reduce the problem. Compression is a process 
intended to yield a compact digital representation of a signal. In the cases where the signal is defined 
as an image, the problem of compression is to minimize the number of bits needed to digitally 
represent the image. There are many applications that benefit when image signals are available in 
compressed form, such as digital photography, electronic commerce, digital video processing and 
archiving digital images for on-line catalogs. Without compression, most digital information and 
transmission through the normal limited bandwidth channels is difficult or infeasible for practical 
use. For example, consider the case of facsimile transmission. Typically, an 8.5 x 1 1 inch page is 
scanned and digitized at 300 dots per inch, thus resulting in 8.5 x 1 1 x 300 x 300 - 8415000 bits. 
Transmitting this data via a low-cost 14.4 Kbps modem could require 9.74 minutes. 



1 



Compression of digital data is the process of representing this data using as few bits or bytes 
as possible. Generally, there exist two types of image compression for digital data - lossless and 
lossy. Lossless compression does not lose any data in the compression and allows image data to be 
stored using less memory, such as Random Access Memory (RAM), than an uncompressed image 
with the ability to restore the original data exactly. Lossy compression further reduces the amount of 
needed memory, but does not guarantee exact restoration of the original data. Existing technology 
for lossless compression, however, may not allow for high compression ratios. If the electronic data 
signal is an image, the differences between the original and the lossy compressed restoration may not 
even be visually noticeable for low compression levels using existing compression technology. 

There exist several lossy compression methods available for image compression. The Joint 
Photographic Experts Group (JPEG) has provided one of the most popular still image compression 
technologies available today. Other file formats include PCX, GIF and BMP. The IS 1 09 1 8- 1 (ITU- 
T.81) standard for image compression is the result of JPEG and is usually referred to as the JPEG 
standard. This standard combines the discrete cosine transform (DCT) and extensive Huffman tables 
to produce a compression algorithm. Since JPEG has an underlying DCT based technology, it 
operates on eight by eight blocks of pixels. Although JPEG is popular, one known problem is that, 
as the compression level increases, the image quality worsens and distortion grows in these blocks 
individually. This problem leads to a block effect that introduces edges into the image, which is 
normally detected as jagged edges or maybe a blurring to the observer of the decompressed image. 
Because storage space or bandwidth may be limited, additional space or bandwidth is costly or 
unavailable, high compression, (such as 100:1), is generally preferred than lower compression of 
data. 

Since the development of digital signal processing in the early 1980's, a digital form of 
wavelet transform called Discrete Wavelet Transform (DWT) has become an important tool for 
image processing and image compression. DWT is a lossless transform, which is used to form an 
orthonomal basis of some, and a dilated master function over a range of, shift and dilation 
parameters. The principle behind the wavelet transform is to hierarchically or recursively decompose 
the input signals into a series of successively lower resolution reference signals and their associated 



detail signals. At each level, the reference signals and detailed signals contain the information 
needed for reconstruction back to the next higher resolution level. One-dimensional DWT (1-D 
D WT) processing can be described in terms of a Finite Impulse Response (FIR) filter bank, wherein 
an input signal is analyzed in both low and high frequency subbands. 
5 A separable two-dimensional DWT process is a straightforward extension of 1-D DWT. 

Specifically, in the 2-D DWT process, separable filter banks are applied first horizontally and then 
vertically. Referring to FIG. 1 , the 2-D DWT process decomposes an image into a reference signal, 
and three detail signals. The signals from the filter bank comprising two filters, first horizontally 
then vertically, gives rise to an analysis in four frequency subbands: subband LL, horizontal low - 
1 0 vertical low as known as a reference signal; subband LH, horizontal low - vertical high; subband HL, 
horizontal high - vertical low; and subband HH, horizontal high - vertical high. Each resulting band 
is encoded according to its unique information content for transmission from a coding station to a 
O receiving station. Constraints, however, exist on how filters can be designed an/or selected, 
■.. j including the need to output perfect reconstructions, the finite-length of the filters and a regularity 
jL; 1 5 requirement that the iterated low pass filters involve convergence to continuous functions. 
■£} Thus what is needed is a method and apparatus of compressing and decompressing image 

^ data that overcomes the problems in prior art. There is also a need to perform high compression of 
; : : data and at the same time redisplay the underlying image at high visual quality. 

Summary of the Invention 
= ,s 20 In view of the foregoing the present invention is directed to a system and method of image 

~* compression. 

In a first embodiment of the present invention, a method of image compression, comprises 
the steps of recursively transforming an image using a Discrete Wavelet Transform. This creates a 
plurality of levels including at least a first level, multiple intermediate levels, and a low-low pass 
25 subband of the last level. The transformed image at each level is quantized, and datapacking the 
quantized image is performed. The step of datapacking further includes, encoding of the first level 
using adaptive run length coding of zero coefficients combined with Huffman codes; encoding of the 
intermediate levels using run-length coding of zero coefficients and a predetermined two-knob 
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huffman table for non-zero coefficients and encoding of the low-low pass subband using a low 
frequency packing algorithm. 

In another embodiment of the invention, an encoder of compressing input data from an image 
comprises a two-dimensional discrete wavelet filter for transforming the input data into plurality of 
5 coefficients. The filter forms a first level, intermediate levels, and a low-low subband of a highest 
level of transformation. A quantizer maps the coefficients into discrete regions by a predetermined 
compression parameter. A datapacker compresses the mapped coefficients. The datapacker encodes 
a plurality of zero coefficients at the first level by adaptive run length coding, a plurality of non-zero 
coefficients at the intermediate levels by a two-knob Huffman coding and the low-low subband at the 
1 0 highest level by low frequency coding. 

These and other objects, features and advantages of the present invention will be apparent 
upon consideration of the following detailed description thereof, presented in connection with the 
following drawings in, which like reference numerals identify the elements throughout. 
Brief Description of the Drawings 
1 5 FIG. 1 is a diagram illustrating divided frequency bands from a process of two-dimensional 

discrete wavelet transform; 

FIG. 2 is a block diagram of a coder and decoder according to the present invention; 

FIG. 3 is a flow chart of an embodiment of a method of compression according to the present 
invention; 

20 FIG. 4 is a diagram showing wavelet subbands according to the present invention; 

FIG. 5 is schematic diagram showing analysis of a Discrete Wavelet Transform according to 
the present invention; 

FIG. 6 is a schematic diagram showing synthesis of a Discrete Wavelet Transform according 
to the present invention; 
25 FIG. 7 is a diagram of a quantized image signal according to the present invention; 

FIG. 8 is a flow chart an embodiment of a method of adaptive run length coding according to 
the present invention; 

FIG. 9A is a diagram of an input data for the method in FIG. 8; 
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FIG. 9B is a diagram of an output data from the method in FIG. 8 according to the present 
invention; 

FIG. 1 0 is an embodiment of a huffman table according to the present invention; 
FIG. 1 1 is a flow chart of an embodiment of a datapacking method according to the present 
5 invention; 

FIG. 12A is an embodiment of a data structure according to the present invention; 

FIG. 12B is an embodiment of a second data structure according to the present invention; 

FIG. 13 is a flow chart of an embodiment of a datapacking method including two-knob 
huffman coding according to the present invention; 
1 0 FIG. 1 4 is an embodiment of a two-knob huffman table according to the present invention; 

FIG. 15 is a diagram of a low-low pass subband and labeling convention according to the 
present invention; 

FIG. 16 is a diagram an encoding order according to the present invention; 

FIG. 17 is a chart of Peak Signal to Noise Ratios for different levels of compression 
1 5 according to the present invention; 

FIGS. 18A-18C are examples of the compressed images using JPEG and the present 
invention at a moderate compression level illustrating the blocking effect; and 

FIGS. 19A-19C are examples of the compressed images using JPEG and the present 
invention illustrating at a relatively high compression level illustrating the blocking effect. 
20 Detailed Description of the Invention 

There is shown in FIGS. 1-16 an illustration of an embodiment of the present invention, a 
method and apparatus for processing a image data. 

Referring to FIG. 2, the method of the embodiment uses three steps for the encoder of image 
compression 20- forward wavelet transform 22, quantization 24, and data packing 26. Each of the 
25 steps will be described in detail herein. 

In a first embodiment of the invention, digital image data 10 having pixel values is provided 
to a wavelet transform process 22. Wavelet transform process 22 uses a multiple level wavelet 
transform, utilizing symmetric biorthogonal seven and nine tap filters. Decimation and interpolation 
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procedures are incorporated in the wavelet transform. This allows the forward and inverse transform 
to be done with about half the number of multiply adds. Also, the image is mirrored at the edges 
before transformation, providing better image reconstruction from the quantized wavelet coefficients. 
Wavelet data is the output data of wavelet transform process 20. 
5 Next a quantization step 24 is preformed on the wavelet data in which a uniform scalar 

quantization technique employs a dead zone around zero. According to the present invention, this 
technique preserves the structure of the wavelet data, allowing efficient compression. The dead zone 
at zero helps to increase compression without introducing large amounts of distortion. The 
compression level is adjusted by a user-controlled parameter CR that effects quantization step 24. 

10 The quantized wavelet coefficients are provided into a datapack step 26. According to the 

present invention, datapack step 26 compresses the quantized wavelet coefficients using different 
techniques. The inventive techniques use adaptive run length coding, Huffman coding, and low 
frequency data packing. The specific type of technique for encoding the wavelet coefficients is 
applied for the resolution level of wavelet coefficients being compressed. 

15 The decompression procedure 40 is the inverse of the compression procedure 20. The 

compressed data is provided to a unpack step 42. This step includes the lookup of Huffman words, 
the decoding of the run length codes, and other data unpacking techniques. The unpack step 42 
reproduces the quantized wavelet coefficients. The quantized wavelet coefficients are then provided 
to a de-quantization step 44. De-quantizion step 44 involves the inverse process of the quantization 

20 step 24. 

The final step in the decompression procedure 40 is to inverse wavelet transform 46 the de- 
quantized wavelet coefficients. Inverse wavelet transform 46 step produces the pixel values that are 
used to create the visual image. Some normalization of the pixel values is preformed due to the 
rounding and quantization error involved in the forward wavelet transform step 22 and inverse 
25 wavelet transform step 46. The reconstructed image data 12 is then displayed to the user by any 
known image display hardware for digital data. 

Forward Discrete Wavelet Transform And Inverse Discrete Transform 

Forward wavelet transform 22 hierarchically decomposes the input signals into a series of 
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successively lower resolution reference signals and their associated detail signals. At each resolution 
level, the reference signals and detail signals contain the information needed to be reconstructed back 
to the next higher resolution level. The one-dimensional DWT (the separable 2-D case is a 
straightforward extension) may be described in terms of the filter bank. DWT is related to Sub-band 
5 Coding & Quadrature Mirror Filter (QMF), and to the Laplacian Pyramid (LP) in Computer Vision. 

H 0 is denoted as a low pass filter and Hi is denoted as a high pass filter in the "Analysis" 
process. In the Synthesis operations, G 0 is denoted as a low pass filter and Gi as a high pass filter. 
The filtering function in digital signal processing is a convolution operation. The basic Multiple 
Resolution Analysis (MRA) in terms of QMP is used in the present invention. An input signal F(z) 

1 0 is input to the analysis low pass filter H 0 (z) and the analysis high pass filter Hi(z). The odd samples 
of the filtered outputs may be discarded, corresponding to decimation by a factor of two. The 
decimation outputs of these filters constitute the reference signal ri(z) and detailed signal di(z) for a 
one level decomposition. For synthesis process (reconstruction), interpolation by a factor of two is 
performed and followed by filtering using the low pass and high pass synthesis filter G 0 (z) and Gi(z). 

1 5 Constraints on filter design include perfect reconstruction (lossless in terms of image quality), finite- 
length (finite number of taps in the filter with no feedback), and regularity (the filter convolved upon 
itself will converge) so that the iterated low pass filters may converge to continuous functions. 

In one embodiment, to provide a lossless QMF design, the discrete filter bank theory to the 
scaling H 0 filter is applied via the standard z-transform: 

20 Ho(z) = ^ c k z~ k and likewise a wavelet Hi filter: H x (z) = ^ d k z~ k . 

£=0 k=Q 

Applying the theory of filter banks, it is noted that to eliminate aliasing, the following 
relationship must be satisfied: 

G 1 (z)=(-ir i // 0 (z) 

jy I (z)=(-i)"G 0 (z) 

25 The following equations may be obtain by appropriate substitution of the above equations: 

F(z)=F*(z). 
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As known in the art, the 1-D DWT may be extended to two-dimensional DWT (2-D DWT). 
The analysis of 2-D DWT is shown in FIG. 5, and the synthesis of two-dimensional DWT is shown 
in FIG. 6. In a 2-D DWT, separable filter banks are applied first horizontally and then vertically. 
5 Application of a filter bank comprising two filters, first horizontally then vertically, gives rise to an 
analysis in four frequency bands; horizontal low-vertical low, horizontal low-vertical high, 
horizontal high-vertical low, horizontal high-vertical high. Each resulting band is encoded according 
to its own statistics for transmission from a coding station to a receiving station. The following 
equations are used to develop the filter. 
10 F al =x{z)H 0 (z) 

F a2 =x{z)H x {z) 





( \\ 




( \\ 




f 


\\ 




r 




X 


z 2 




z 2 


+ x 




z 2 






z 2 




K J 




K ) 






J 






) 




( ° 




( ° 




f 


1 N 




f 


1 \ 


X 


Z 2 




z 2 


+ x 




z 2 






z 2 




V ; 














V 





F a5 =±{x(z)H 0 (z)+x(-z)H 0 (-z)} 
15 Fab = \ {x{z)H(z) + x(- z)Hx{- z) } 

The present invention includes performing a DWT on an image using computation methods 
such as convolving the image and the filters. Discrete convolution is a multiply and add procedure 
using the coefficients of the filter, and the image pixel values. Due to the convolution, there is a 
20 transient effect that occurs at the image edges. To reduce this effect, the present invention pads the 
image at all edges. This padding is a mirror reflection of the image. This padding helps prevents the 
transient affect from distorting the image when performing the inverse DWT in step 46 on the de- 
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quantized coefficients. Without quantization, the wavelet transform is a lossless function, but the 
distortion caused by the quantization can cause undersized losses in image reconstruction. Padding 
advantageously reduces this effect. 

In one embodiment, the D WT transform process is simplified due to compression algorithm 
5 20 using symmetric, odd length filters on image data 10 that is constrained to have dimensions that 
are multiples of two raised to the number of wavelet levels (2 levels ). Because the output for each pixel 
occurs when the filter is centered on that pixel, the number of pixels that need to be mirrored is equal 
to the length of the filter minus one, divided by two. 

The mirroring of a row of image pixels is illustrated below. If an image has a row of pixels, 
10 ao to a n .i. and is convolved with a filter of length 5, then the mirrored row would be symbolically 
denoted as the following: 

a2 aj | ao ai a2 a3. . ♦ a n .3 a n _2 a n _i | a n _2 a n .3 where n is the number of pixels. 
A convolution of a row with a filter of length 3, is demonstrated as below: 

If the filter is defined by F = [A B C] and the row is defined by row - [ 1 2 3 4 5 6 7 8 ] then 
15 the mirrored row should be constructed as 2|1 2 3 4 5 6 7 8|7. After convolution, the resulting 
coefficients are as follows: 

A2+B1+C2, A3+B2+C1, A4+B3+C2, A5+B4+C3, A6+B5+C4, A7+B6+C5, A8+B7+C6, 
A7+B8+C7 where the transient coefficients are not included. Only the results of the convolution 
when the filter is centered on the pixel are kept. 

20 After performing the convolution, a decimation operation is performed. This decimation 

operation reduces the number of data values by a factor of two by selecting every other coefficient. 
Because half of the coefficients are discarded, it is advantageous to skip the operations that produce 
these values. Therefore, when the convolution is performed, only every other data point is calculated 
and the decimation is performed at the same time. In the above example, after decimation, the result 

25 would be as follows: A2+B1+C2, A4+B3+C2, A6+B5+C4, A8+B7+ C6. 

Referring to FIG. 5, the forward wavelet transform performs the low pass and high pass 
functions first on the rows, and then the columns of the image data. The low pass decimation is 
performed by retaining only the odd numbered elements, then the high pass decimation is performed 
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by keeping only the even numbered elements. After performing both the row and column operations, 
there will be four frequency subbands. Each of these subbands will have one- fourth the elements of 
the original data set. This is because of decimation after both the row and column operations. 

FIGS. 4-6 illustrates the steps to perform the forward wavelet transform 22 and inverse 
5 wavelet transform 46. The mirrored image, input signal, is first convolved with a low pass filter, 
then decimated. The mirrored image is also convolved with a high pass filter and decimated. This 
procedure is repeated with each of the results of this stage. This produces a one level wavelet 
transform. To create more wavelet levels, the low-low pass quadrant is used as the new input, and 
the procedure is repeated. As shown in FIG. 4, there are five wavelet levels preformed that produces 

10 16 subbands. It is appreciated that less than five or more than five levels may be performed and are 
considered part of the invention. 

The step of inverse wavelet transform 46 is the reverse process of the forward wavelet step. 
First, column interpolation is performed on the input signals by inserting zeros into the data. Then, 
the signals are convolved with the low pass and high pass filters, and then the results are summed. 

15 Row interpolation is performed, and then low pass and high pass filtering is performed, and the 
results are summed. This is performed for one level of the inverse wavelet transform. For additional 
levels, this result is treated as a low-low pass subband, and combined with the next level high-low, 
low-high, high-high, subbands. 

In the embodiment shown, the decimation and interpolation steps are included in the forward 

20 and inverse convolution thereby reducing the number of multiply and adds required. The decimation 
procedure is performed by skipping the multiply and adds in pixel locations that would be discarded 
to decimation. The interpolation procedure is performed by dropping some of the filter coefficients 
due to the convolution taking place on an interpolated data set that has zeros for every other value. 
This arrangement advantageously offers much computational savings. 

25 Referring to FIGS. 2 and 7, quantization step 24 includes the process of converting a 

continuous signal into a discrete, digital representation of that signal. Choosing how to map the 
continuous signal to a discrete signal depends on the application of the image use. In one 
embodiment, for low distortion and high quality reconstruction, the quantization step 24 is 
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constructed in such a way to take advantage of the signal's characteristics. Scalar quantization is 
herein defined that each wavelet coefficient is quantized separately, one at a time. Uniform 
quantization is herein defined that the structure of the quantized data is similar to the original data. 
In general, quantizing wavelet coefficients for image compression involves a compromise between 
5 low signal distortion, and the ability to produce quantized coefficients that are easily compressed. 
The method of the present invention advantageously employs the probability distribution of the 
wavelet coefficients to enables such high compression of the image. 

The method of compression includes uniform scalar quantization. The resolution of 
quantization is controlled by user-defined parameter CR. The larger the parameter, the more 

1 0 coarsely the wavelet coefficients are quantized. This coarse quantization includes more distortion to 
the signal, but allows higher compression of the data because the dynamic range of the quantized 
coefficients is reduced. Quantization step 24 used in compression method of the present invention is 
a uniform quantization with a dead zone centered at zero. This means that the zero bin is twice the 
size of all other bins. The uniform quantization is performed by first dividing the wavelet 

15 coefficients by user-defined parameter CR. The new values are truncated to produce a rounding 
towards zero effect, regardless if the value is positive or negative. 

Referring to FIG. 2, de-quantize step 44 is preformed after step 42 of unpacking the data. 
First, 0.5 is added to the quantized wavelet coefficient value. This shifting of the quantized value to 
the middle of the bin is used to prevent excessive rounding towards zero. This shifted value is then 

20 multiplied by user-defined parameter CR. In de-quantize step 44, if the quantized coefficient value 
is zero, then the value after de-quantization will also be zero. 

In one embodiment of the invention, datapack step 26 includes several methods that are 
applied to different wavelet transform levels. FIG. 3 is a flow chart of an embodiment of a method 
of datapack step 26. At step SI 00, the image data is received. At step SI 02, adaptive run length 

25 coding is applied for three subbands (low-high, high-low, and high-high) at wavelet transform level 
one. At Step S 1 04, for additional wavelet transform levels, a two-knob Huffman data pack method is 
performed. At step SI 06, a low frequency data packing method is used for the low frequency 
subband of the highest wavelet level. Each of the datapacking methods will be described in detail 
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herein. 

Referring to FIGS. 8-12, a method of using adaptive run length coding is illustrated. The 
adaptive run length coding method includes Huffman codes with run-length coding of zero 
coefficients. At step 300 and 302, the adaptive run length coding process is started in which the 
5 wavelet coefficient input data is scanned to find the largest coefficient magnitude. The number of 
bits needed to represent this coefficient is then written to the header as a four-bit word. 

At step 304, run-length coding on the zero coefficients is performed on the input data. The 
output of this run-length coding is a data stream consisting of non-zero coefficients, and zero plus 
run words. A run is herein defined as the number of zeros following the first zero of a sequence of 

1 0 sequential zeros. A sequence of just one zero is coded as a zero followed by a zero. A sequence of 
ten zeros, would result in a zero followed by a nine. Referring to FIGS. 9A and 9B, the input data 
shown in FIG. 9A includes three runs- runl having eight zeros, run2 having 5 zeros, and run3 having 
1 0 zeros. As shown in FIG. 9B, the output data for runl includes a zero indicator and a run word of 
7. The second run or run2 includes the zero indicators and a run word of 4. Shown in FIG. 12B is a 

15 representation of the data structure after run length coding in step 304. The zero indicator D24 is 
followed by run word D26. Run word D26 is followed by a non-zero wavelet coefficient D28. 
Referring to step 306, shown in FIG. 8, the output data is further compressed using a Huffman table 
as illustrated in FIG. 10. 

FIG. 1 1 illustrates the data packing method of step 306 and FIG. 12A shows a data structure 

20 for encoding the runs in the output data. This data structure will be explained in the foregoing 
detailed description. Referring to FIG. 1 1 , at step 400, when writing a coefficient not found in the 
Huffman table, an escape code is written first to a compressed datastream. Then, the coefficient is 
written in signed bit representation using the number of bits determined earlier in step 302. At step 
402, during the encoding, if a zero is encountered, the zero is encoded first. At step 404, following 

25 the zero, three bits are allocated to writing the run. Then at step 406, the number bits needed to 
represent the run is determined. If three bits are not enough to write the run-length, then following 
these three bits, another zero is written then control is transferred to step 404. The following three 
bits are then allocated to writing the run-length. The procedure is continued until the run-length has 
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been written. Then at step 408, after the run-length, the next word is the next non-zero coefficient 
and that next non-zero coefficient is encoded. In this coding scheme, a run is at least one in length. 
Therefore an encoded run is the actual run length minus 1 . Given the processing speed of the 
computer equipment implementing the method, it is appreciated that more than three bits may be 
5 used to writing the run or equally less than three bits made be allocated to write the run both are part 
of the invention. 

FIG. 12A illustrates a representation of the data structure after encoding a run in the present 
invention. There is a header element that provides information on the number of bits of the largest 
coefficient. Adjacent thereto may be a zero indicator element D10 representing that start to a zero 

10 ran. The next data objects D12, D14, D16 are the components that make an exponential 
representation wherein sum of D 12, D 14, and D 16 equals powers of two. Thus, if a run length of 8 
needs to be encoded then data objects D12, D14, D 1 6 would have a one therein and the sum of D12, 
D14, D16 would be equaled to three. If the run length is greater than eight then the sequence is 
repeated. Zero indicator element D10 is again encoded and data objects Dl 8 5 D20, D22 are encoded 

15 accordingly. 

The following is an example of how a run of 256 zeros is encoded - 011101110110. In 
other words, each bit representation represents 2 raised that power. In the example of 256 zeros, the 
representation would be stated as 2 1+1+1+1+1+1+1+1 or equivalently 2 s . 

Referring to FIG. 13, a flowchart of a datapacking method of present invention is shown 
20 including a two-knob Huffman coding algorithm. This data packing method is performed on the 
quantized wavelet coefficients on the high-low, low-high, and high-high quadrants of all wavelet 
levels other than the first level. This form of data packing is a lossless compression function that 
uses run-length and Huffman coding techniques. 

At step 500, the dynamic range of the data is determined. The number of bits needed to code 
25 the largest valued data coefficient is then written to the header part of the data stream. As shown in 
step 502, the datapacking method includes performing run-length coding of the zero coefficients as 
was explained in step 304 shown in FIG. 8. The output of this run-length coding is a data stream 
consisting of non-zero coefficients, and zero plus run words. After performing the run-length coding, 
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the function determines the longest run of zeros and the number of bits needed to code this run. This 
number is written to the header part of the data stream. 

Then at step 504, the datastream generated from the run-length encoding is then compressed 
using a Huffman code. As shown in FIG. 14, a predetermined table including the Huffman code 
5 (two-knob code) has words for the most frequently occurring symbols. At step 504, encoding the 
non-zero coefficients is a lookup procedure based on using the Huffman code with the predetermined 
table. If the symbol does not have a Huffman code, then an escape code is used to code this symbol. 
After the escape code, a signed bit representation of the coefficient is sent to the output data stream. 
The number of bits used to code this coefficient was determined earlier by scanning the data set and 

1 0 writing the number of bits needed to the header in step 500. 

The zero-runs are encoded by using a zero code word, followed by a run code word. 
Encoding the runs is a little more complex because there are two Huffman words used to represent 
two types of runs. The two types of runs are designated herein as "long" or "short". A run is herein 
considered to be "long" if its length has to be coded using more than half the number of bits allotted 

1 5 for the longest run. A run is herein considered to be "short", if the run can be coded using half the 
number of bits or less allotted for the longest run. Once the encoder 20 determines if the run is 
"long" or "short", then the corresponding zero Huffman word is sent to the data stream. After this 
word, then the run is coded using the appropriate amount of bits. 

In datapack step 26, a low frequency data packing method is used to compress the low-low 

20 pass quantized wavelet coefficients of the highest wavelet level. FIG. 15 shows a low-low pass 
subband with an indicated labeling convention. Referring to FIG. 1 5, the encoding process starts by 
locating the largest DC coefficient (DC1 - DC9). The number of bits needed to code that value is 
determined. This bit size is written to the header as a four-bit word, and then the DC coefficients are 
written in unsigned bit representation using the determined number of bits. 

25 FIG. 1 6 illustrates the encoding process in a row- wise manner. The differences between the 

coefficients in the same row are calculated. These values will be the differences between ACi and 
DC1, ACi2 and AQ1, ... , ACin and ACin-1. The maximum difference value is then located, and 
the number of bits needed to code this difference is written to the header as a four-bit word. Next, the 
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differences between the coefficients in the same row are written using signed bit representation using 
the determined number of bits. This data is coded by storing the DC coefficients, and then the 
difference values for the remaining coefficients across the rows. 

FIG. 17 illustrates test data of a chart the Peak Signal-to-Noise Ratio (PSNR) values for 
5 different compression levels. The Peak Signal-to-Noise Ratio is the most commonly used metric of 
image quality in image and video compression literature. The test image was a high resolution 
picture suitable for publication in a magazine. The test image was 24-bit color, 1 .86 x 2.79 inches at 
1 100 dots per inch (dpi). The data file size was 18.9 Mbytes. JPEG compression was compared 
against the apparatus and methods in accordance with the present invention. JPEG compression was 

10 limited to a compression level of 178:1, as a result, the blocking effect was very noticeable, and 
image quality was inferior to the present invention. 

The PSNR uses the Mean Squared Error for the mean of the sum of the squares of the 
differences between the values of pixels in two images. The formula for the MSE is as follows MSE 
= 1/ n * (Sum(i)(j)|P(i)C)-Q(i)0)l A 2) where P is an original image and Q is the reconstructed one; i 

1 5 and j are the horizontal and locations of a pixel; P(i)G) is the value of the pixel location (i)(j); and n 
is the total number of pixels for the image. The Root Mean Square Error is the square root of the 
Mean Squared Error in which RMSE - Sqrt (MSE). 

Finally, the mathematical formula for the PSNR is as follows: PSNR = 10 Logi 0 (b/RSME) 
where b is the peak value for a pixel, typically 255. The PSNR is usually quoted in decibels, a 

20 logarithmic scale. PSNR is commonly used despite that it has a limited, approximate relationship 
with perceived errors notices by the human visual system. In general, the higher the PSNR, the 
higher the quality of an image. Note however, there are certain cases which can increase the PSNR 
without increasing the perceived image quality. An example of this is uniform image backgrounds 
that do not add any resolution to the region of interest. 

25 FIGS. 18A-8C illustrate the visual quality of an image at different compression levels 

between conventional JPEG and the present invention. FIG. 1 8B was compressed and decompressed 
using JPEG at a compression level of 77: 1. FIG. 1 8C was compressed and decompressed using the 
apparatus and methods of the present invention at a compression level of 162:1. FIGS. 19 A, 19B, 
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19C illustrate the visual quality of an image at almost the same compression levels between JPEG 
and the present invention. FIG. 19B was compressed and decompressed using JPEG at a 
compression level of 1 63 : L Here the blocking effect is very noticeable. FIG. 19C was compressed 
and decompressed using the apparatus and methods of the present invention at a compression level 
5 of 162:1. 

It is apparent the embodiment presented herein may be implemented on a general purpose 
computer in the form of a conventional personal computer, including a central processing unit, a 
system memory, and a system bus that couples various system components including the system 
memory to the central processing unit. The system bus may be any of several types of bus structures 

10 including a memory bus or memory controller, a peripheral bus, and a local bus using any of a 
variety of bus architectures. The system memory includes read only memory ("ROM") and random 
access memory ("RAM"). 

It is appreciated that present invention can process digital image data for video applications. 
The method can be employed in real time video capture systems using real-time or off-line 

1 5 compression. In one embodiment, the method is applied in a video system capturing a sequential 
series of still images. These images can be considered frames as similarly founded in motion picture 
film. Color images consist of three color components, a red component, R, a green component G, 
and a blue component, B. To process color images efficiently these components can be transformed 
into luminance, Y, and chrominance, U and V, components by using the following color 

20 transformation equations. 

Y = R * 0.299 + G * 0.587 + B * 0.114 
U = -R * 0.169 - G * 0.332 + B * 0.081 

V = R * 0.500 - G * 0.419 - B * 0.081 

This transformation is typical in color image compression because the human visual system is 
25 more sensitive to the luminance component than the chrominance components. Therefore, one 
embodiment of the present invention can reduce the resolution of the chrominance components by 
decimation of a factor of two without loss of image quality. This results in the image being 
processed with full resolution luminance, and quarter sized chrominance, or 4:2:0 format. This 



16 



advantageously, reduces the storage capacity and higher bandwidths for transmitting digital video 
data. 

The methods can be embodied a computer program product in a computer usable medium, 
such as a floppy drive, optical disk, or magnetic hardware drive. The computer usable medium 
5 includes computer readable code that causes a computer to execute a series of steps. The computer 
readable code may be stored on a server connected to a public switched telecommunication network, 
such as the Internet including the World Wide Web. 

While the invention has been describes with reference to embodiments, it will be understood 
by those skilled in the art that various changes may be made and equivalents may be substituted for 
1 0 elements thereof without departing from the scope of the invention. In addition, many modifications 
may be made to adapt a particular situation to the teachings of the invention without departing from 
the scope thereof. Therefore, it is intended that the invention not be limited to the particular 
embodiment disclosed, but that the invention will include all embodiments falling within the scope 
of the appended claims. 

15 



17 



